FM-WAP Mining: In Search of Frequent Mutating Web Access Patterns from Historical Web Usage Data
نویسندگان
چکیده
Recently, a large amount of work has been done in web access pattern (WAP) mining. Most of the existing techniques focus on mining WAP that occur frequently from snapshot web usage data collection. However, web usage data is dynamic in real life. The dynamic nature of web usage data leads to two challenging problems. The first problem is the maintenance of existing WAP mining results. The second problem is discovering hidden knowledge from the historical changes to WAP. As the frequent WAPs are useful in many applications, knowledge hidden behind the historical changes to WAPs, which reflects how WAPs change, is also critical to some applications such as adaptive web, web site maintenance, business intelligence, etc. In this paper, we propose a novel approach to discover hidden knowledge from historical changes to WAPs. We define a new type of knowledge, Frequent Mutating WAP (FM-WAP), based on the historical changes to WAPs. Moreover, a FM-WAP mining approach, which can discover all the FM-WAPs efficiently from historical web usage data, is presented. Firstly, the historical web usage data collection is partitioned into different groups according to the user-defined calendar pattern, where each group is represented as a WAP forest. Then, changes among the WAP forests sequence are detected and stored in the global forest. Lastly, the FM-WAP is extracted by traversing the global forest. Extensive experimental results show that our proposed approach is efficient with good scalability to very large web usage data collection. Moreover, this approach can produce new knowledge about web access patterns that cannot be discovered by existing techniques.
منابع مشابه
Mining of Users’ Access Behaviour for Frequent Sequential Pattern from Web Logs
Sequential Pattern mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. The task of discovering frequent sequences is challenging, because the algorithm needs to process a combinatorially explosive number of possible sequences. Discovering hidden information fro...
متن کاملHigh Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملConcurrency in Web Access Patterns Mining
Abstract—Web usage mining is an interesting application of data mining which provides insight into customer behaviour on the Internet. An important technique to discover user access and navigation trails is based on sequential patterns mining. One of the key challenges for web access patterns mining is tackling the problem of mining richly structured patterns. This paper proposes a novel model ...
متن کاملKnowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey
Sequential pattern mining is the process of applying data mining techniques to a sequential database, to extract frequent subsequences to discover correlation that exists among the ordered list of events. Web Usage mining (WUM) discovers and extracts interesting knowledge/patterns from Web logs is one of the applications of Sequential Pattern Mining. In this paper, we present a survey of the se...
متن کاملUse of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems
One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004